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ABSTRACT 


Design of CHCuits is earned out in a hieiuchnl minnci with vciification done 
thiough simulations at each level of the hieiaichy The liming simulatois aie used loi 
Imdiiig out the delay of the designed cneuUs at the clcctiie il level Simiil itois like SPICL 
give veiy accurate infoimation regarding the delays but they aie not useful for simulating 
veiy laige cucuits as they consume lot of time Many simple models have been pioposed 
to develop fast liming simulatois but they have lesulted in loss ol accuiacy owing to the 
simplified models th it they use Reseiich is going on to atl iin it consideiably iccui ite is 
well as cilicient models 

One such model that has been pioposed w is studied ind the leison foi the loss ol 
accuracy pointed out The complete modeling has been redone with suitable coirections 
applied and the gam in accuracy with a very minute loss in ellieicncy is demonstnted 
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CHAPIER 1 

INTRODUCTION 


Design of cncuils has become a complex task with the sUle ot the ait chips 
containing millions of tiansistois Design of ciicuits is earned out in i hiei uchial m innci 
with Ihc design lust done U llie bell ivioi il level ind llieii li iiisl itcd down to the linil 
physical level step by step Veiification of the design is necessity so th it the final chip 
lunclions piopcily II tins veiification is c lined onl it the liiiil physic il level ind is 
found not to match specifications then once again the ciicuit needs to be designed it the 
behnvioial level ind all the intermediate steps need to be repealed once igain So to ivoid 
this wastage of time ind effoit veiificition is c lined out it e ich step bcfoie being 
ti insl ited to the next level I his vculicalion is c lined out by using sinuil itois 

With the complexity of ciicuits incicising diy by diy the demand loi a veiy last 
yet effective and accuiate simulation is increasing The pioblem lies in thit il the 
simulatoi IS designed to be highly accuiate then due to the complexity of the models 
lequired foi such a siinul iloi the speed of the simulUion deeieiscs In oidei to icdiiec 
the lime of simulation if the models aie simplified they icstill m eiiois Ihit become 
unacceptable So many new techniques foi this simulition hive been proposed \nd 
developed to have an optimum siniulatoi 


1 1 SIMULATION 

To make the task of designing complex circuits easy i hieiarchial design ind 
veiification procedure is used In this the overall task is paititioned into moie 
manageable components using various levels of abstiaclion The diffeient levels of 
absti actions vary fiom each other by the size of the components and the iccuracy of the 
models used to depict these components This leduces the oveiall verification lime 
requiied because at the highest level (behavioial level) infoimation about the inteinil 
stiucture is missing ind only the high level function ihty of the system needs to be 
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veiitied It the ciicuit is as pei the specifications then it c in be ii insl Ued to the next level 
and now with some moie details added a moie detailed veiitic ition can be coined out 
The design tnd vciilicUion flow foi any ciicuit will be as shown in Figuic 1 1 



Figure 1 1 Design and Veiilication hieiarchy 
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IN FRODUCTION 


At the behavioral level the verification is carried out by just verifying the 
functional behavior At the RTL stage the verification is earned out by RTL simulators 
and these need to verify functional blocks like registers combinational and sequential 
blocks The RTL simulators provide more accurate information and may sometime also 
piovide crude timing information as well These are helpful for data path design At the 
logic or gate level simulators check the circuit by evaluating the voltage or logic at each 
node by using the Boolean expressions of the gates These can provide first order timing 
details by inserting delays at the end of logic evaluation The actual rise and fall delays 
are calculated by using more accurate timing simulators that generally work at the 
transistor level and consider the layout apd the technology dependent parasitics After the 
transistor level circuit is mapped to the physical level then a detailed extraction of the 
parasitics is done and the ciicuit specifications are again venfied with these new values of 
parasitics 

In each step, as the design is being mapped to the next level details are being 
added and hence the simulation at each level takes more and more time as we go down 
the design hici iichy but the accuracy increases 

ArrintArv 

A 

FLECTRICAL LEVEL 

LOGIC LEVEL 

RTL LEVEL 

BFIIAVIORAL I FVFI 

► 

SPFFD 

Figure 1 2 Levels of simulation and their respective speed of simulation 
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INTRODUC nON 


Electrical (transistor) level simulators like SPICE provide the most accurate 
inform ition on the voltage waveform at the circuit nodes and the current through the 
ciicuit elements However these are computationally very intensive and take a lot of time 
and so their usage is limited upto a few thousand transistors 

Inorder to bridge this gap between the slow but highly accurate electrical 
simulation and the fast but inadequate logic level simulation many intermediate level 
simulation levels have been incorporated like switch level logic simulation switch level 
timing simulation fast timing simulation and timing simulation All these have been 
explained m biief 


Af r lIIMf Y 

A 

ELECTRICAL LEVEL 

TIMING SIMULATION 

FAST TIMING SIMUI ATION 

SWITCH LEVEL TIMING SIMULATION 

LOGIC I FVFI 

► 

SPFFn 

Figure 1 3 Intermediate levels of simulation 

A switch level simulator is basically used to determine the steady state values of 
nodes and then to apply a delay operator if the steady state values change It considers the 
transistois to be switches, which are controlled by the voltage applied to the Gate 
tennmal These are inaccurate The timing simulation is more accurate than a switch level 
simulation but they are still slow for very large digital circuits because they use the time 
point Iteration techniques Fast timing simulations have been developed to account for 
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INTRODUCTION 


this bclvvcLH the switch level simulator and the timing simulator These have hecn 
made as accurate as the timing simulators but also fast enough for large circuits Several 
different approaches to fast timing simulation have been proposed but on a broad basis 
they have been made fast and accurate by the following features that they incorporate 
like 

1 Event driven 

2 Simplified Transistor models 

3 Use of primitive or macro models to which sub circuits are mapped 

4 Efficient use of analytical solution techniques for nodal differential equations 

5 Waveform relaxation techniques for circuits with feedback 

1 2 FAST TIMING SIMULATION 

Since its inception, fast timing simulation techniques have been vastly improved, 
and the existing timing simulators are as accurate as conventional timing simulators 
Seveial different approaches have lead to different timing simulators They can however 
be broadly classified into two basic types The first type use integration formulae to 
convcit the differential equations to difference equations and use a linear macro model to 
simplify and solve the differential equations The time steps in the simulation are usually 
small for obtaining accuracy and these are too inefficient to simulate large VLSI circuits 
Examples of this type of simulators include Elogic SPEC2 MOTIS3 MDCsim Event 
EMU and ADEPT The second type uses a nonlinear macro model or primitive and 
solves the differential equation governing node voltages analytically The non linear 
macro models provide accuracy and the analytical solutions provide the required 
efficiency These simulators have speed ups of the order of 3 magnitudes over SPICE2 
and so from the point of view of efficiency are suitable for simulating large VLSI 
circuits Examples of this category are IDSIM2 and ILL! ADS 

ILLIADS incorporates all the above mentioned five concepts to achieve 
very good efficiency along with a considerable accuracy[2] In this the circuit is 
paititioned into blocks of MOS devices resistors and capacitors called as DCCBs (DC 
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connected blocks) and directed graphs are constructed The circuit graph is then 
condensed and the order in which the blocks need to be simulated is found and put in the 
collect temporal order for simulation and then as per the event dnven methodology the 
simulation is done It uses primitives or macro models to which the circuit is mapped 
The circuit primitive used m ILLIADS is the generalization of the inverter macro model 
shown in Figure 1 4(a) The generic circuit primitive for MOS digital circuits is also 
shown in Figure 1 4(b) The primitive contains linear capacitances and resistances and 
parallel blanches of NMOS and PMOS transistors Piecewise linear voltage sources are 
applied at the gate and dram temiinals All the voltage dependent gate dram capacitances 
are lumped at the gate and drain/source nodes Also, the junction and overlap 
capacitances are lumped at the corresponding terminals of the MOS devices The 
resistive branches have been included m the circuit primitive for generality but do not 
occur m MOS digital circuits and so are excluded from simulation 


Vi)D 



(a) Inverter macro model (chargmg/dischargmg) 

Dl Dk Dk 4I Dm D 



Figure 1 4 (b) Genenc MOS circuit primitive in ILLIADS 
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INIROnUC I ION 


1 3 NODE MERGING AND PRIMITIVE MAPPING 

In a DCCB all the nodes can be categorized into four different types >esistne 
mtenuil chiven and pass Resistive nodes are those which are connected to resistors 
cipicilois and/oi gale Icimiinls of MOS transistors Internal nodes arc the ones lint 
connect the dc paths of only one type of transistor A driven node connects the dc paths 
of both NMOS and PMOS tnnsistors and has a dc path to a voltage source that contains 
no other nodes apart from internal nodes All the other nodes are classified as pass nodes 

In ILLIADS all internal nodes are eliminated by senal and parallel merging of 
tiansistors and finding an equivalent transistor and an equivalent gate signal For semi 
merging two transistois with transconductances Pi and p the transconductance of the 
cquiv ilcnt li insistoi is given by piP2/(Pi t-Pa) and the equiv dent gde signal is taken to be 
the weaker of the two gate signals Incase of parallel merging of two transistors the 
transconductance of the equivalent transistor is given by P1+P2 and the equivalent gate 
signal IS given by the stronger of the two signals The internal node elimination can be 
done m two ways One way is to directly take the complete gate nctlist and start 
eliminating the intciml nodes by the serial and parallel merging technique The other is 
to lust bitak the iictlisl into a net of NAND/NOR gales and then to replace each such 
gate by an equivalent inverter In ILLIADS the first technique is used Once the netlist 
has been 1 educed by eliminating internal nodes to the maximum extent it is mapped to 
the primitives available and the output computed 


1 4 FOCUS OF THE WORK 

In tiansistor merging the objective is to obtain the parameters of an equivalent 
transistor that reflects the I V characteristics of the serial or parallel combination In 
general this is a very difficult problem to solve analytically pnmarily owing to the 
complex nature of the MOS dram current equations and the complexity of the 
relationship between the dram current and the MOS model parameters As a result a 
number of assumptions are required to perform the merging A few of the most common 


7 


INIRODUCIION 


assumptions aic to tike the same model for all the transistors that need to be merged to 
neglect the channel length modulation and the parasitic capacitance effects and to take 
the same gate signal for all the transistors that need to be merged Though this procedure 
of finding the effective gate signal transconductance and channel length modulation 
works well for long geometry devices errors as large as 100% have been reported for 
sub micron devices [1] These errors have been found to anse in the senal merging of 
transistors with the errors being very low in parallel merging This is due to the effect of 
the internal nodes and the parasitic effects at these nodes in moderately to large number 
of serially connected transistors So it is important to take the effect of the internal nodes 
m the computation of the output waveforms 

Lot of work Ins already been done m mapping the nethst to NAND/NOR gates 
and also an efficient and accurate enough technique of mapping senal transistors to an 
equivalent invciter has been reported m [1] In this the chain of serially connected 
transistors has been replaced by an equivalent transistor and only one parameter the 
width of the equivalent transistor has been computed to match it to the chain of 
transistors Also the model used for the transistor is the alpha power law model [3] which 
takes into account the velocity saturation effect which is one of the prominent short 
channel effects and thus results in more accurate results of the output Both the 
conducting as well as the parasitic behavior of parallel and senally connected transistors 
has been considered and an equivalent transistor extracted m each case taking into 
iccount the actual operation conditions of each device The modeling has been developed 
for non zero input transition times In order to map the input signals to an equivalent 
signal an input mapping algonthm has been proposed and also the weight of each 
transistor position m the gate structure has been extracted The algorithm of first 
collapsing a complex gate to an equivalent NAND/NOR gate and then mapping it to an 
equivalent inverter has been used The starting point of conduction of the transistor chain 
has been calcuhled to much accuracy and the equivalent transistor width has been 
calculated by fitting the actual output of the transistor chain to the output of the 
equivalent inverter Due to the analytical solution approach used in the paper the 
efficiency in calculating the equivalent inverter parameters is very high and the model 
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h IS been developed with the view to model the equivalent invcitci usin^ ILLIADS 
which IS hij^hly efficient 

But m the calculation of the starting point of conductance the effect of substrate 
voltage (or the body effect) had been neglected which resulted in inaccurate values of the 
staiting point of conduction Also the results for slow ramp inputs have shown more 
ciiois (Inn for fast ramps In this approach the effective width has been calculitcd at a 
time tint lias been cilculatcd ftom the starting point of conduction but then a different 
fitting parameters have been used for slow and fast ramps to get accuracy over a wide 
range of input transition times This has resulted in more errors for slow ramps than for 
the fast ramps 

In this work the complete model for collapsing a series connected transistor chain 
to an equivalent transistor has been venfied Also dunng the analysis for finding out the 
starting point of conduction the modulation of the threshold voltage due to the body 
effect has been taken into account and hence highly accurate results for the starting point 
of conduction have been obtained These have also resulted in very accurate results in the 
final output for slow ramps as well and the need for a different fitting parameter has been 
cliinm itul 


9 


Chapter 2 

CMOS GATE MODELING 


CMOS GATE MODELING 


There are two techniques of modeling a gate The first method corresponds to the 
generalization of the inverter model and is based on a fully mathematical analysis of the 
gate structure This approach leads to a high mathematical complexity and hence less 
efficiency The second method utilizes the well established theory of inverters and 
research has been focused on the development of sophisticated methods for collapsing a 
CMOS gate into an effective equivalent inverter, whose output response will reflect 
accurately that of the gate The accuracy of this method is comparable to the former 
while the mathematical complexity is less resulting in better efficiency Several 
techniques have been developed for this second method of modeling the gate In order to 
map the gate to an equivalent inverter a delay degradation factor was introduced by 
Sakurai and Newton[5] while Nabavi Lishi and Rumin presented a semi emperical 
model[6] But all these have resulted in limited accuracy and finally for complex 
structures they have ended up in the conventional n times transconductance reduction for 
an n transistor chain 

Recently the transistor chain characteristics has been studied from a 
macromodeling point of view In [1] a method for modeling CMOS gates by an 
equivalent inverter has been proposed In this approach the key points in the behavior of 
CMOS gates are modeled analytically in order to improve the accuracy of the final 
cqiiivnluit inverter, whose transistor’s widths are calculated efficiently taking into 
account the mode of operation viz cutoff saturation and linear of the transistors in the 
gate Such key points are the starting point of gate conduction, which has a significant 
impact on the output waveform the form of the internal node voltages the parasitic 
behavior of the chain and the weight of each transistor position in the chain Also a very 
efficient algorithm for reducing each possible input pattern into an effective single 
equivalent input that can be applied to an inverter model has been proposed in this paper 



CMOS GATE MODELING 


In order to obtain the output voltage waveform at the output of a gate it needs to 
be mapped to an equivalent inverter Any complex gate is nothing but an interconnection 
of series and parallel combinations of transistors So for mapping any gate to an 
equivalent inverter seiial and parallel transistors in the gate need to be collapsed into a 
single equivalent transistor As mentioned in section 1 4 the equivalent transistor for 
parallel transistors can be directly found out by adding up their transconductance and the 
errors introduced due to such a mapping are very low But for replacing a serially 
connected chain of transistors by an equivalent one the parasitic behavior and also the 
behavior at the intemal nodes need to be considered to be able to get accurate results 
This technique as mentioned in [1] is described in the succeeding sections The input 
waveform for all the inputs of the chain has been considered to be the same and has been 
taken as a ramp input A mapping algonthm for mapping all the possible input patterns to 
an equivalent ramp input has been proposed for this m [1] 


2 1 NMOS TRANSISTOR CHAIN MODEL 

A gate can have a chain of PMOS or NMOS transistors So if the characteristics 
of one of the NMOS or PMOS chain are studied then the characteristics of the other will 
follow suit as they will be of the same nature So here a chain of NMOS transistors has 
been considered To replace a chain of transistors by an equivalent transistor the 
response of the chain was studied and the equivalent transistor chosen such that its 
response matches that of the chain To analyze the response of a chain of n NMOS 
transistors consider the circuit shown in Figure 2 1 (a), where the parasitic drain/source 
node capacitances are also shown An input ramp with transition time t is assumed to be 
applied to the gates of alt the transistors in the chain 


ElUmML LlBRWT 
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(c) (d) 

Figure 2 1 (a) Cham of n NMOS transistors (b) Two transistor equivalent model 
(c)Single equivalent transistor model (d) Currents at the i‘^ node dunng tj , ts,+i 
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CMOS GATE MODELING 


The input ramp can be expressed as 

^ 0 



where t is 


t < 0 

0 < t < T (2 1) 

t > T 

time penod of the ramp 


In order to consider higher order effects the a power law model [3] which takes 
into account the earner velocity saturation effect of short channel devices has been 
considered for the transistor currents The dram current according to this model is given 
by 


{ 0 Vgs ^Vtn cutoff region 

O^Gs - ^Ds Vds < Vd sat linear region (2 2) 

Ks(V,,~Vr,r Vds ^ Vd sat saturation region 

where Vdsaf is the dram saturation voltage Ki Ks are the transconductance 
parameters which depend on the width to length ratio of the transistor a is the carrier 
velocity saturation index and Vtn is the threshold voltage expressed by its first order 
Taylor series approximation around Vsb = 0 2Vdd as 

“ VtnI vsB »0 2 VDD + (Vtn) I vsB= 02 VDD (Vsb - 0 2 Vdd) = 0 + 5 Vsb (2 3) 

where Vsn is the source to substrate voltage i e the body voltage 

The Taylor series has been calculated around Vsn = 02 Vdd since the voltage was 
found to he close to the midpoint of the voltage swing of the source node of the top most 
transistor which is of primary importance to the proposed analysis A detailed denvation 
is given in the Appendix The whole transistor chain of Figure 2 1 (a) can be reduced to 
two transistors based on their behavior where in the top most transistor Mn is been left 
untouched and the remaining transistors mapped to an equivalent transistor as shown in 
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Figure 2 1 (b) The single transistor equivalent will be as shown in Figure 2 1 (c) The 
reason is as follows 

Let the voltage at the source of the top most transistor i e at the node n 1 be Vm 
which IS a function of time While the input is applied and considering all the internal 
nodes to be initially discharged all the transistors will be cut off at the beginning as the 
gate voltage of all the transistors will be less than the zero body bias threshold volt igc, 
Vto But owing to parasitic capacitances (as shown in Figure 2 1 (d) ) the voltages at the 
inlcnncdiatc nodes (0 to n 1) begin to rise due to the current Icmi At some time when 
the input voltage reaches Vto the bottom most transistor Mi goes into conduction by 
entering into the saturation region All the other transistors will be cut off as their source 
voltages are greater than zero (and hence less Vqs) due to the charging of the parasitic 
capacitances C, Now as the transistor Mi provides a discharge path for C, the voltage at 
node 1 begins to fall and also as the gate voltage is increasing a point will come when its 
dram source voltage will become equal to the gate source voltage and so it will enter the 
linear region At this very instant the gate to source voltage of the transistor above it will 
be Vto If we neglect the body effect then the transistor will enter into conduction by 
going into saturation at this instant Even with the body effect considered which will 
modulate the threshold voltage the upper transistor will enter into conduction somewhere 
near this point as the modulation of the threshold voltage will be very less 

This process will continue and when the top most transistor (and hence the whole 
chain) enters into conduction with the top transistor going into saturation all the bottom 
transistors will be in the linear region The top most transistor operates m the saturation 
region for some time and when its drain source voltage becomes equal to the Vdsat it 
enters linear region while the transistors below it always remain in the linear region 
throughout this penod So from the time at which the whole chain starts conduction the 
top most transistor expenences two regions of operation i e the saturation as well as the 
linear region while the transistors below it will always operate in the linear region All 
the bottom transistors as they behave identically can be replaced by an equivalent 
transistor and the top transistor which operates in two regions need to be handled 
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CMOS GATE MODELING 


differently So the whole chain is first collapsed to a two transistor equivalent with the 
top trinsistor left untouched and all the bottom transistors replaced by a single equivalent 
transistor and then they are collapsed to an equivalent transistor Since the idea is to find 
out the equivalent transistor by analytical method the equivalent transistor has been 
found out by observing the output waveform at the node n 1 which corresponds to the 
output of the lower n 1 transistors The effective width of the transistor is computed to 
get the same response The voltage of the output waveform from the time the whole chain 
startsconducting say// to the time at which the upper transistor enters linear region say 
t 2 has been found to be linear and rising to a voltage called as the plateau voltage 


2 2 PLATEAU VOLTAGE 

From the time interval dunng which the top most transistor (Mp) enters the 
saturation region and the input is rising its current and consequently the voltages at the 
internal nodes are increasing When the input reaches Vdd and till the time the top most 
transistor exits saturation (if it has not already done so) its current and hence the internal 
node voltages remain constant This is because the gate voltage is constant (Vdd) and so 
the drain current of the top most transistor which is m saturation will be constant 
Suppose the internal node voltages nse owing to a difference in the charging and the 
discharging currents of the parasitic capacitances then the voltage at the source of the 
top most transistor will also rise which will cause a reduction of its gate source voltage 
This will decrease its dram current and hence the charging current for the internal nodes 
So the voltage will drop again Now, suppose the voltage at the internal nodes try to 
decrease then due to the increase in the gate source voltage of the top most transistor, the 
dram current will increase charging up the nodes and thereby increasing their voltages 
So the voltages at the internal nodes will remain the same as long as the top most 
transistor remains m saturation after the input has reached the maximum value of Vdd 
T his state is known as the plateau state and dunng this time the same current flows 
through all the transistors in the chain Figure 2 2 (b) shows the occurrence of plateau 
voltage (Vp) and is the output obtained by SPICE 
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Now we have seen that for the plateau state to appear the top most transistor 
must still remain in saturation region even after the input has reached the maximum of 
Vdd This can obviously happen when the input ramp is fast enough that it can reach Vdd 
before the top most transistor exits saturation So the plateau voltage appears only for 
fast inputs Fast and slow inputs are classified according to when the time at which the 
top most transistor exits saturation (ta) occurs If ta > t then it is a fast ramp (shown in 
figure 2 2 (a) ) and if t 2 < t then it is a slow ramp (shown m figure 2 2 (b) ) For slow 
ramps the top most transistor enters linear region before the end of the ramp and the 
voltage at its source starts to fall before the plateau voltage can be reached Now the time 
at which the top most transistor enters linear region from the saturation region depends 
on when its dram source voltage drops below the Vdsat This depends on how fast the 
load capacitance Cl is discharging Suppose if the load capacitance were large enough 
that Vd sat is reached after the end of the ramp then the plateau voltage can be reached 
One observation that has been made by looking at the output of SPICE waveforms is 
that the slope of the voltage at the source of the top most transistor i e node n 1 remains 
constant between time ti and ta whatever be the load capacitance This is because we are 
considering the slope at a time when the top transistor is in saturation and in saturation 
the current depends only on the gate source voltage and not on the drain source voltage 
So the current will be the same in saturation for any load capacitance and so, the voltage 
at all the nodes below this transistor will follow the same characteristics for any load 
capacitance This has been shown in figure 2 3 (a) So whether the ramp is a slow one or 
a fast one we can always assume the load capacitance to be large enough and hence that 
the voltage at node n 1 reaches the plateau voltage 

It should be noted that the effect of the channel length modulation has been 
neglected m the calculation of this plateau voltage Since channel length modulation 
specifies that even in saturation the dram current is dependent on the dram source 
voltage considering it will cause increase the complexity and will result m iterative 
equations So, it has been neglected and the errors have been found to be insignificant 
when compared to the results of SPICE 
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In order to calculate the plateau voltage Vp at the source of the top most 
transistor of the chain consider the circuit shown in Figure 2 1 (a) The plateau voltage 
occurs at the end of the input ramp when V,n = Vdd and the current ceases to increase 
Thus V) can be calculated by setting the saturation current of the top most transistor 
(Mu) equal to the current of the bottom transistor (Mbeq) shown m Figure 2 1 (b) which 
operates m the linear mode 

Ksu(Vdd 0 (l+8)Vp)*^ = Kibeq (Vdd — Vto Mp (2 4) 

The equation can be solved with very good accuracy using second order Taylor 
senes approximation The solution has been explained in detail in Appendix 


2 3 EQUIVALENT WIDTH 

A single equivalent transistor will be able to replace a transistor chain if it 
manages to have the same output response as that of the combined behavior of the chain 
with the dual operation of the top transistor in saturation as well as linear region For the 
time interval that the top transistor is in saturation the current through the top transistor is 
the bottleneck for the current that is flowing through the complete chain So the 
equivalent width (Wcq) of the single equivalent transistor Mcq as shown in Figure 2 1 (c) 
dunng this time can be obtained by equating the current through the top most transistor to 
the current through the equivalent transistor 

Imh “ iMeq 

==■ =P.^{y^-rro) (25) 

The above equation can be solved for different values of t yielding corresponding 
values of Weq values For the time interval [ti, t 2 ] for which the top transistor operates in 
saturation Weq plotted against time has the form as shown if Figure 2 3(b) In order to 
find the average effective value of Weq, the time point ta needs to be calculated This can 
be done by solving the following equation at the output node n (detailed solution is given 
m Appendix II) 

Vd s ATn [ta] VouiCta] — Vm [t2 ] (2 6) 
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It must be noted that in the calculation of t the effect of the parasitic short circuit 
current of the PMOS transistor structure during the switching iction has been neglected 
This has been done purposefully to keep the mathematical complexity low The effect of 
this parasitic current is to add some more charge to the load capacitance and thus extend 
the saturation region This will affect the Wcq calculated and so to keep the complexity 
low instead of adding the expression of the current of the PMOS transistor structure the 
effective Weq has been chosen so as to compensate for these parasitics SPICE results 
h ive shown Ih it the bound on the saturation region due to these parasitic currents 
depends on the input slope and the load capacitance This can be the result of the increase 
in short circuit current as the input transition time increases and as the output load 
capacitance decreases A very good approximation found valid for a wide range of input 
slopes and capacitances is to calculate the Weq at time t = t 2 for fast inputs and at t = 
(ti+3 3t2)/4 for slow inputs where ti is the time at which the complete chain starts to 
conduct This Weq corresponds only to the region where the top most transistor is in 
saturation and so will be referred to as Wsat If there is no parasitic action of the PMOS 
chain then Weq is calculated at t = (ti+t2)/2 As this equivalent width has been calculated 
for the time when the top most transistor operates in the saturation region from now on 
It is referred as Wsat 

Wsai = Weq[t] 

Where t = (ti+3 3t2)/4 Slow ramps 

t = t 2 Fast ramps (2 7) 

When all the transistors operate m the linear region, the transistor chain acts as 
voltage divider with a uniform distnbution of the output voltage among all the 
drain/source nodes So the Weq for the time for which the top most transistor is m the 
linear region can be just calculated as the average of the widths 

A-s the effective width of the equivalent transistor for each region is now known 
the chain can be modeled by a single transistor whose effective width is Wsat from time ti 
to t 2 and for the rest of the time is Wim But as the aim is to provide an equivalent width 
that would match the existing inverter models the above two widths need to be merged 


I 
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into one This can be done by calculating the effective charge that is discharged to the 
ground in each case Then weights can be attached to each of the effective widtlis to 
calculate the overall equivalent width as 

Weq=Csat + C„n W,,n (2 8) 

Where 


Csat Qsat ! Qtotal 


C„n = l Qa, (2 9) 

Where Qtotai is the total charge that is stored initially m the output load and is 
given by 

Q.oiai = CL Vdd (2 10) 

Qsat IS the fraction of charge that is discharged to ground dunng the time in which 
the top transistor in the chain operates in saturation and is given by 

Qsat = Qtoiai-Q[t2] = Cl Vdd " Cl VoM (2 11) 


Note that in the above equations (2 5) and (2 6) Vm is a function of time In order 
to know the function Vm we need to know the points (Vi ti)and(V 2 t 2 )or(VpT) since 
Vm is a linear function of time passing through these points as shown in Figure 2 2(a) and 
Figure 2 2(b) Of these Vp has already been calculated and x is nothing but the time 
period of the input ramp So Vi and ti remain to be calculated and then Vm can be 
formulated as 

VM(t) = V,+m (t-ti) (2 12) 

Where m the slope of Vm is given by 

m = (Vp Vi)/(tti) (2 13) 


2 4 STARTING POINT OF CONDUCTION 

In a transistor chain with initially discharged internal nodes and the same input 
applied to the gates of all transistors the closer to the ground transistors start conducting 
first and then the top transistors follow So m order to be able to model efficiently the 
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chain by an equivalent transistor the starting point of conduction of the chain which is 
actually the staiting point of conduction of the top most transistor needs to be calculated 



Figure 2 4 Internal node voltage waveforms till the complete chain starts conduction 

The voltage waveforms at the internal n 1 nodes of an n transistor chain when the 
same input is applied to all the inputs are shown in Figure 2 4 When the input is applied 
initially all the transistors remain cut off as the gate voltage of all the transistors will be 
less than Vio But owing to parasitic coupling capacitances (as shown in Figure 2 1 (d) ) 
the drain voltages tend to follow the input ramp due to the current Icmi The coupling 
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capacitances arise due to the gate source and the gate dram overlap capacitances of the 
upper and lower tiansistors respectively and are given by 

Coverlap ~ W[Cgdo + Cgso] (2 1 4) 

where W is the transistor width and Cgdo and Cgso are the gate dram and the gate 
source overlap capacitances per micron which are determined by the process technology 


Until the time when the input voltage reaches Vto and when the bottom most 
transistor Mi goes into conduction by entering the saturation region the nodes are 
isolated between the two cutoff transistors So the voltages at all the nodes can be 
obtained by equating the current due to the coupling capacitance Icmi at the node i with 
the charging current of the parasitic node capacitance Ic, (Figure 2 1 (d) ) 

IcMi = Ici 


=>C 




dV -dV dV 
dt ‘ dt 


=> 






(215) 


Since the input ramp has been considered to have a time penod x at the time tsi 
when Ml goes into conduction the input will reach Vto So, tsi can be calculated as 

T 


^ I ~ ^TO 


(216) 


DO 


All the other transistors will be m cutoff as their source voltages are greater than 
zero due to these node voltages Now as Mi transistor provides a discharge path the 
voltage at node 1 begins to fall and also as the gate voltage is increasing a point will 
come when its dram to source voltage will become equal to the gate source voltage and 
so It will enter the linear region At this very instant the gate to source voltage of the 
transistor above it will be Vto So if we neglect the body effect then it will enter into 
conduction by going into saturation at this instant Even with the body effect considered, 
which will modulate the threshold voltage the upper transistor will enter into conduction 
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somewhere nearer to this point as the modulation of the threshold voltage will be very 
less 


After the time the bottom transistor Mi starts to conduct (tsi) and the time the 
complete chain starts to conduct (ti) the node 1 will be subject to two opposite trends 
One tends to pull the voltage at the node up due to the charging through the coupling 
capacitance which is intense for fast inputs while the other is to pull it down to 7 ero by 
the dischaiging cuirent through the transistor (i e its dram current) Initially Mi enters the 
conduction region by operating in the saturation region (since its Yds is greater than its 
Vgs which IS near zero) Since the gate dram coupling capacitance in will be small the 
charging rate will be less Due to the high rate of discharging the second of the trends 
dominates and the voltage at the node 1 will begin to fall This will continue till the time 
when the transistor Mi will enter the linear region (at ts 2 ) At this very instant the second 
transistor will enter into conduction as its gate source voltage approaches Vto From this 
time onwards the gate source capacitance of the second transistor and the gate dram 
capacitance of the bottom transistor will increase (This happens because the overlap 
capacitances depend on the region of operation The actual relation is given m 
Appendix), and so the rate of charging will also increase It has been observed by SPICE 
smiLilations that during the time at which the second transistor enters into conduction 
(saturation) and the whole chain starts to conduct, these two opposite trends seem to 
balance each other and the node voltage almost remains constant At the time when the 
third transistor enters into conduction the same process happens at the node 2 and m 
general at all the internal nodes the same process happens and the output waveforms 
will be as shown m Figure 2 4 

This process will continue and when the top most transistor (and hence the whole 
chain) enters conduction by going into saturation at time (tsn = ti) the node voltages 
begin to rise Another observation which has helped m getting at the value of ti is the 
slopes of these internal node voltages As can be seen from the graph which has been 
venfied by SPICE the slopes of the node voltages dunng the time [ts, ts,+i] are almost 
same for all the transistors and so this assumption has been considered which is a safe 
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one without ending up in significant errors This slope say m can be den\ed by solving 
the differential equation which results from the Kirchhoffs current law at node 1 (Figure 
2 1 (d) ) 

iMI == ICMl ICl 


'^s (V -yro) = C 


A/ 1 


dV dV, 


dt dt 


-C. 


dF| 

dt 


( 217 ) 


Here for simplicity, the velocity saturation index a has been considered to be 


one 


To get the average slope say r of the node voltages dunng the time [ts, tji+i] the 
time points tji ts 2 and the corresponding dram voltage (node voltages at the node 1 for 
these two time points) need to be calculated We already have tsi from equation (2 16) 
and the corresponding node voltage from equation (2 15) Since at time tsz the second 
from bottom transistor M 2 starts conducting the time ts 2 can be calculated by solving the 
equation 

VGs[ts2]=VTN2[ts2] (218) 

So from these values r can be calculated Similarly by using the equation (2 1 8) 
the time points at which the other transistors enter into conduction can be calculated The 
starting point of conduction of the whole chain i e ti can be calculated by solving the 
following equation 

VGs[ts,] VTN,[ts.l = 0 

=> (Vi„[t,i] Vs[t„]) - (Oo + 5oVs[t„]) = 0 (219) 

Where Vs[tsi] is the voltage (with respect to ground) at the source of transistor i at 
time tsi and can be calculated as 

t'-' ■) (220) 

The constants Oo and 5o are the ones obtained from the equation (2 3) for Vsb = 
Vto for higher accuracy because the node voltages he close to Vto and not around 
0 2Vdd as in equation (2 3) 
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Equation (2 19) results in a recursive expression for tji and is given as 


00 +(1 + 00) 


'kf I 


DD 


V ^kfi i + ^ 


+ r 


+(l + 5o)rr 


i>2 


(2 21 ) 


And ti will be equal to tsn Thus having obtained ti all the above equations can be 
solved and thus the effective width Wcq of the equivalent single transistor that can replace 
the complete chain can be obtained 

2 5 DISCUSSION 

It should be noted that in the calculation of ti while calculating the value of ts, 
and thus ti the effect of body voltage has not been considered Due to this some error in 
the calculation of ti arises and as the number of transistors in the chain increases this 
error increases because the body effect affects the values of all ts, values and hence as n 
increases the error introduced also increases So though the method of finding the value 
of ti as discussed above is a simplified approach and finds the value analytically it has 
resulted m errors which could have been avoided had the body effect been considered 
In the next chapter analysis has been done to find out the value of ti more accurately by 
considering this body effect Also while doing this computation care has been taken to 
see that the extra calculations involved do not reduce the efficiency of the overall 
computation, since the idea is to implement it into a fast timing simulator 
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Chapter 3 

IMPROVED CMOS GATE MODELING CONSIDERING 

THE BODY EFFECT 


3 1 BODY EFFECT 

During fabrication all the MOS devices are fabricated on a single substrate As a 
result, the substrate voltigc of all the devices will be the same So when a number of 
transistors are connected in series, the source substrate voltage of all the transistors will 
not be equal and it will increase as we proceed vertically in the chain During the 
operation of a MOS transistor when Vqs is greater than its threshold voltage the 
depletion width remains constant and charge earners are pulled into the channel from the 
source But as the substrate bias is increased the width of the channel substrate depletion 
layer also increases resulting in an increase in the density of the trapped earners in the 
depletion layer to maintain the charge neutrality the channel charge must decrease The 
result IS that the body voltage adds to the channel substrate junction potential This 
increases the gate channel voltage drop The overall effect is an increase in the threshold 
voltage Therefore the threshold voltage is no more constant with respect to the source 
substrate voltage This effect of modulation of the threshold voltage by the substrate bias 
IS called as the substrate bias effect or the body effect The relation between the threshold 
voltage to the substrate voltage is given in Appendix I and has been used in equation 
(2 3) 

3 2 EFFECT ON THE STARTING POINT OF CONDUCTION 

In the analysis carried out in the previous chapter, the body effect had been 
considered for calculating the Vtn for all the transistors in the transistor chain But m the 
modeling done for finding out the starting point of conduction the idealization of the 
output that has been done as in Figure 2 4 is valid only if the body effect is not taken 
into consideration In the Figure 2 4 the transistor M 2 has been said to go into conduction 
exactly at the time the bottom transistor enters linear region from the saturation region 
This means that the threshold voltage of this transistor is equal to Vto i e the body effect 
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has not been considered or that its body substrate voltage is zero This should mean that 
the SLibstiates of all the transistors are tied to their respective source terminals and this 
will result in a circuit as shown in the Figure 3 1 (a) The waveforms at the internal nodes 
of such a chain obtained by SPICE level 3 simulations are shown m Figure 3 2 (a) From 
this giaph It is clear that the waveforms are falling linearly during the time [tsi ts,+i] and 
exactly at the time the node voltage becomes constant the transistor above it is entenng 
into conduction This validates the idealization done in Figure 2 4 



Figure 3 1 (a) Cham of 4 transistors with substrate of each transistor connected to its 
source terminal (b) Chain of 4 transistors with a common substrate which is grounded 
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(b) 


Figure 3 2 (a) Output of a chain of 4 transistors without considenng the body effect 
(b) Output of a chain of 4 transistors considenng the body effect 
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But in practice the type of chains that are encountered are of the form shown in 
Figure 3 1 (b) wherein the substrate for all the transistors is common and so is at the 
same potential This causes the threshold voltage of all the transistors to be different and 
it increases as we move up the chain due to the increasing source body voltage When 
the bottom transistor enters the linear region the Vrs of the transistor above it will be 
equal to Vto, but still as its Vtn will be greater than Vto it will not enter into 
conduction The result is that the bottom transistor will operate in the linear region for 
sometime dining which its di iin voltage will be still falling but at a different rate before 
the transistor above it enters into conduetion Once the upper transistor starts conducting 
then the analysis mentioned m the previous chapter holds good and the dram voltage of 
the bottom tiansislor will almost become constant During the time that the bottom 
transistor is in linear region and the transistor above it is not conducting the rate of fall of 
the dram voltage of the bottom transistor will be different from the rate at which it was 
falling when the transistor operated m the saturation region The result is that the linear 
approximation taken m the previous chapter for this region is no longer valid To get 
accurate results the region will have to be broken up into two different parts and modeled 
scpantcly with the first region modeled as before with the bottom transistor m 
saturation and the second region modeled with the bottom transistor m linear region The 
output of the internal nodes of a 4 transistor chain with a common substrate, as shown m 
Figure 3 1 (b), obtained by SPICE level 3 modeling is shown m the Figure 3 2 (b) The 
late occurrence of the starting point of conduction of the transistors in the chain from the 
time the bottom transistor exits the saturation region is pretty clear from the graphs 

3 3 PWL MODELING FOR STARTING POINT OF CONDUCTION 

In the previous analysis we had assumed that the transistors operate only in the 
saturation region from the time they start conduction to the time the transistor above it 
starts conduction This helped in increasing the efficiency of the computation because in 
saturation the current as well as the parasitic capacitances is independent of the dram 
source voltage and what we are trying to find out is the dram source voltage itself (node 
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voltaj^es) Once we divide the region into two regions we can again easily calculate the 
V lines rcl ited to the situration region But m the calculation of the values related to the 
lineu legion the computations become iterative because the dram current and the 
parasitic capacitances (Refer Appendix I) are now dependent on the dram source voltage 
which is exactly what we want to compute In doing so though the accuracy is increased 
we have to sacrifice lot of computational efficiency Instead of going for iterative 
expressions we can make some assumptions which can reduce the equations to ordinary 
qiiadntic cxpiessions without sacrificing much of the efficiency The assumptions that 
have been taken are not just taken to fit to the data but have been done by studying the 
output response of the waveforms obtained by SPICE 3 simulations By looking at the 
wavefonns of Figure 3 2 (b) it can be seen that the slope of the waveform from the time 
a transistor goes into conduction by entering into saturation to the time it goes into the 
linear region is same for all the transistors (same as the assumption taken m the previous 
chapter) Similarly the waveform from the time the transistor enters the linear region to 
the time the transistor above it starts conduction can be approximated by a straight line 
and also the slope for all the transistors is nearly the same This simplification will lead to 
the reduction of the Figure 3 2 (b) to that shown in the Figure 3 3 for a general n 
transistor chain 


So if say that the average slope of the waveform for the time [ts, ti,] is r and that 
for the time [hi fii+i] is s, then we need to calculate the voltage at the dram of the bottom 
transistor for the time points t^i t,i and tsj and from these r and s can be obtained Once 
we have these values then using them, we can easily find out the time point t\ So the 
PWL approximation of the output waveform at the nodes of the transistor chain and the 
assumption that the average slopes for the time [tj, h,] and [t|, ts.+i] are same for each 
node, helps a lot in simplifying the complexity involved in the calculation of the starting 

point of conduction 
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Figure 3 3 PWL of the internal node voltages till the complete transistor chain starts 
conduction 


3 4 APPROXIMATION OF PARASITIC CAPACITANCE 

Another problem involved with this two region modeling is the dependence of the 
parasitic capacitances on the dram source voltage which will also result in iterative 
expressions The reason is that the node voltages are obtained because of the charging of 
these parasitic capacitances and these again depend on the dram voltages To reduce this 
complexity and avoid sacnficmg the efficiency another assumption is made for the 
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c\lcuhtion of the capacit'inces in the linear region le for the time [ti, ti+i] The 
c ip icit incc C| it the nodes of the transistors as shown in Figure 3 I (b) arise due to the 
parasitic dram substrate and the source substrate capacitances which depend on the dram 
source voltage when the transistor operates m the linear region The dependence is shown 
in Appendix I In the saturation region the value of the capacitance Ci is independent of 
the dnm source voltage and becomes equal to 2/3(ai l)Cox (Refer Appendix I) but 
during the linem region it will keep on increasing and for the case of the drain source 
voltage becoming zero the value becomes equal to (ai l)Cox 

In the linear region the dram voltages of all the transistors will be very less by the 
time they start operating m the linear range which will result m C, value close to (ai 
l)Cox This will increase further to reach the final value of (ai l)Cox when the node 
voltages become zero So the node capacitance value has been approximated to (ai 
l)Cox fot II"*® linear region This approximation leads to some loss of accuracy while 
allowing better efficiency because the iterative expressions reduce down to ordinary 
equations 


3 5 CALCULATION OF STARTING POINT OF CONDUCTION 


As mentioned earlier we need to calculate the average slopes r and s m order to 
calculate the starling point of conduction t| For the calculation of r we calculate the 
value of the dram voltage of the bottom transistor for the time points tsi and tli as shown 
in Figure 3 3 The charging/discharging of the parasitic capacitances will be the same as 
shown in the Figure 2 1 (d) Time point tsi is the time when the bottom transistor enters 
conduction and so will be the time at which the input ramp reaches the value of Vto So, 
tsi and the corresponding dram voltage say Vsi will be given by 


t 


SI 
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c, 
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SI 




TO 


( 31 ) 
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The time point tn is the time at which the bottom transistor enters the linear 
region It can be calculated by applying the Karchoff s current law equation at the node 1 
of I iguie 2 1 (d) and using the condition that at this time the dram source voltage of the 
transistor will equal its (Vqs Vt) A detailed derivation is given in Appendix III 

U=irMi~ici (3 2) 




‘IV,. ‘iv„ 


-c. 


dV. 


S\ 


clt 


(3 3) 


(It dt 

Fiom tii the value of the dram voltage at that time say Vn can be calculated and 
so r will be given by 

r = (Vsi VH)/(t|, tsO (3 4) 


I 01 cilcLilalmg the viluc of s we need the value of drain voltage of the bottom 
transistor at tlie time points tn and ts 2 For this time the transistor will be m the linear 
region and so by solving equation (3 2) with the expression of the dram current for linear 
region and using the condition that at time tsi the transistor above the bottom transistor 
goes into conduction, ts 2 can be calculated 


‘=^K,{V,s-V,o) 


an 


Vos = Qn 


dV„ 
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dt 

clt 

dt 


(3 5) 




(3 6) 


Wliere Vsi' is the dram voltage of the bottom transistor corresponding to 
time tsi Detailed derivation is given in Appendix III 


Now, s will be given by 

s = (Vsi’ Vn)/(ts 2 tn) (3 7) 

By generalizing the equations (3 5) and (3 6) a recursive expression for finding 
out the value of h, can be obtained as 
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t, 
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I I ^ +1 
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(3 8) 


Wheie, tsi IS given by 
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(3 9) 


By solving these recursive expressions we can find out the value of ti which is 
nothing but tsn T he v ilucs of ti ne c ilculatcd by this procedure and those obtained by the 
analysis carried out in the previous chapter have been compared with the results of 
PSPICE level 3 model for a transistor chain with 4 and 6 transistors and are shown in 
riguic 3 4 (a) and (b) The plots clearly show the increase in the accuracy of calculation 
of the starting point of conduction of the chain that is obtained by the new analysis Also 
it can he seen that the approximations that we have taken for simplifying the equations 
have not mtiodiiced significant errors 

The lemaming analysis has been earned out as discussed in chapter 2 for finding 
out t 2 and Vp and the has been calculated at t = t for both the fast and the slow 

ramps 


The results obtained for the equivalent inverter by calculating the values of Weq 
by using this new analysis for the starting point of conduction are far more accurate than 
those oblatncd in the previous ense A detailed discussion on the results has been 

presented m chapter 4 
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(a) 



(b) 


Figure 3 4 Comparison of starting point of conduction for (a) n - 6 (b) n 4 
for different values of nse time for the input ramp ( Ct—lOOfP 5u W— 8u) 
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Chapter 4 

RESULTS AND DISCUSSIONS 


4 1 RESUI I S 

As the iccui icy of the equivalent width depends on how accurately we model the 
voltage at node n 1 i e the voltage Vm the gained accuracy m the calculation oft] and 
thus Vm will reflect onto accuracy of Weq The improvement gained m the calculation of 
the starting point of conduction can be seen from the Figure 3 4 As the idea is to get the 
effective width of the tiansistor chain and thus the inverter which can be then simulated 
using Hist oiclci models the viluc of the equivalent width that is obtained has been 
incoipoi Ucd into PSPICF level 1 invcitci model and the accuracy verified The increase 
in iccui icy is evident fioni the Figure 4 1 where some outputs have been shown relating 
to diffcient input tiansition times and lengths for a 4 input NAND gate In [1] the results 
of equivalent inverter weie incorporated into ILLIADS inverter model and the outputs 
complied to a SPICF 3 model But since ILLIADS was not available and as it is a 
pi iniitivc level 1 model, the icsults h ive been verified using the PSPICE level 1 model 

Since in a timing simulator, the nidin idea is to compute the delays the delay 
obtained by PSPICE level 3 model of a complete NAND gate with different number of 
inputs, input trinsition times and three different lengths has been compared with the 
delay obtained by incoiporatiiig the calculated effective width into a PSPICE level 1 
model, and shown in Tables 4 1 4 2 and 4 3 The percentage errors calculated have also 
been shown and fiom these the increase in the accuracy obtained due to the 
consideration of the body effect in the calculation of the starting point of conduction can 
be seen 


In Older to verify the accuracy of this improved analysis two more complex gates 
and the AOI1221 have been taken and the outputs compared to that of PSPICE level 3 
outputs for the same circuits 
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^NEW 

\^psiicr 


FigUie 4 1 Comparison of output wavefoims of PSPICE level 3 model for Gales and 
the equivalent invciter foi L=0 5u (a) N=2 t=lns (b) N-2 i 2ns (c) N 2 t 4ns (d) 
N=2 T^Sns (e) N=4, T=2ns (f) N=4 x=4ns 
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1 AUI F 4 1 Plop igation Delays (in ns) and Percentage Errors for PSPICE level 3 model 
Method IS pioposed in [1] (Old) and the Proposed Method (New) of NAND gate with 2 
4 ind 6 inputs foi diflctcnt input tiansilion times and Length = 0 5u(W=8u C|=100fr) 


N 

X 

SPICF 

01 D 

%rRROR 

NFW 

%FRROR 

2 

0 5 

0 256 

0251 

1 95 

0 250 

2 34 

2 

1 0 

0 347 

0 370 

6 63 

0 350 

0 87 

2 

20 

0 501 

0 602 

20 16 

0 533 

639 

2 

40 

0 774 

0 991 

28 04 

0 851 

9 95 

2 

8 0 

1 218 

1 734 

42 36 

1 411 

15 85 


N 

X 

SPICI 

01 1) 

%CRROR 

NEW 

%ERROR 

4 

0 5 

0 420 

0 404 

3 81 

0 370 

1190 

4 

1 0 

0 504 

0 504 

0 00 

0 501 

0 60 

4 

2 0 

0 789 

0 852 

7 98 

0 771 

2 28 

4 

40 

1 192 

1 380 

15 77 

1 220 

2 35 

4 

80 

1 923 

2 361 

22 78 

2 039 

6 03 


N 

X 

SPICE 

OLD 

%ERROR 

NEW 

%ERROR 

6 

05 

0 637 

0 547 

14 13 

0 484 

24 02 

6 

1 0 

0 739 

0 709 

18 00 

0 638 

13 67 

6 

2 0 

1 000 

1 049 

49 

0 963 

3 70 

6 

40 

1497 

1 686 

12 63 

1452 

3 01 

6 

8 0 

2 397 

2 623 

9 43 

2 435 

159 


39 
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FABI ^ 4 2 Piopigition Del lys (in ns) and Percentage Errors for PSPICE level 3 model 
Method IS pioposcd m [I] (Old) and the Proposed Method (New) of NAND gate with 2 
4 ind 6 inputs foi dilfciuit input it msilion times and Length = 0 35u (W=8u CL=100fF) 


N 

T 


01 D 

%FRROR 

NEW 

%FRROR 

2 

05 


0212 

2 91 

0 207 

0 49 

2 

1 0 

0 282 

0 336 

19 15 

0 305 

8 16 

2 

20 

0 441 

0531 

20 41 

0 471 

6 80 

2 

40 

0 673 

0917 

36 26 

0 757 

12 48 

2 

8 0 

1 094 

1 638 

49 73 

1 298 

18 65 


N 

T 

SFICI 

01 D 

%rRROR 

NEW 

%ERROR 

4 

05 

0 316 

0311 

1 58 

0 295 

6 65 

4 

1 0 

0 445 

0 477 

7 19 

0 432 

2 92 

4 

20 

0 666 

0 758 

13 81 

0 672 

0 90 

4 

40 

1 055 

1268 

20 19 

1 096 

3 89 

4 

80 

1 745 

2 205 

26 36 

1 893 

8 48 


N 
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SPICE 

OLD 

%ERROR 

NEW 

%ERROR 

6 

0 5 

0 440 

0 414 

5 91 

0 369 

1614 

6 

I 0 

0 508 

0510 

0 39 

0 503 

0 98 

6 

2 0 

0 836 

0 938 

12 2 

0 807 

3 47 

6 

40 

1 308 

1 535 

17 35 

1 314 

0 46 

6 

8 0 

2 183 

2 652 

21 48 

2 254 

3 25 
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TABI F 4 1 Prop igation Dehys (m ns) and Percentage Enors for PSPICE level 3 model 
Method ns proposed in [1] (Old) and the Proposed Method (New) of NAND gate with 2 
4 ind 6 inputs (oi different input tnnsition times and Length = 0 25u (W=8u Ci =100fF) 


N 

T 

SPICF 

OI D 

%FRROR 

NFVV 

%FRROR 

2 

05 

0 171 

0185 

8 19 

0175 

2 34 

2 

1 0 

0 241 

0 294 

21 99 

0 263 

913 

2 

20 

0 365 

0 483 

32 33 

0 416 

13 97 

2 

40 

0 586 

0 858 

46 42 

0 701 

19 62 

2 

8 0 

0 993 

1 568 

57 91 

121 

21 85 


N 

t 

spicr 

OI D 

F., ERROR 

NEW 

%ERROR 

4 

0 5 

0 256 

0 27 

5 47 

0 248 

313 

4 

1 0 

0 385 

0 421 

9 35 

0 381 

104 

4 

20 

0 580 

0 694 

19 66 

0 605 

431 

4 

4 0 

0 947 

1 179 

24 50 

1 012 

6 86 

4 

80 

1 613 

2 089 

29 51 

1 785 

10 66 


N 

T 


OLD 

%ERROR 

NEW 

%ERROR 

6 

0 5 


0 344 

0 00 

0 300 

10 18 

6 

1 0 

0 482 

0 502 

1 04 

0 469 

2 70 

6 

2 0 

0 726 

0 844 

16 25 

0716 

1 38 

6 

40 

1 186 

1425 

2015 

1204 

152 

6 

8 0 

2 023 

2 503 

23 73 

2 129 

9 69 
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RESULTS AND DISCUSSIONS 


Two moic complex gntes is shown in the Figure 4 2 and Figure 4 3 have been 
taken md the results hive been tabulated in the Tables 4 4 and 4 5 An AOI1221 gate as 
shown in Figuic 4 4 h is ilso been tikcn and the results tabulated m Table 4 6 All these 

ex implcs show the inciease in the iccuracy of the computation of delay when compared 
to the pievious inilysis is discussed in [1] 


Vdd 



Figui c 4 2 Complex Gate 1 [i(b+c)]' (Ci = lOOfF) 


T able 4 4 Propagation Delays (m ns) and Percentage Errors for PSPICE level 3 model 
Equiv ilcnl NAND gate level 3 model and the Equivalent inverter level 1 model for 
diffeicnt input transition times and Length = 0 5u 


T 

PSPICE 

EQ NAND 

% ERROR 

EQINV 

% ERROR 

05 

0 234 

0 212 

9 40 

0 204 

12 82 

1 0 

0 320 

0 285 

10 94 

0 288 

10 OO 


0 423 

0 393 

7 09 

0 405 

4 26 

40 

0 626 

0 572 

8 63 

0 627 

016 

80 

0 942 

0 827 

1221 

0 971 

3 08 
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V,m 



Flguic4 1 CompluGUt.2 AOI1221 ciicuit(CL= lOOlF) 


Tabic 4 5 Fiop\gUion Delays (in ns) ind Pcicenlagc Enois loi PSPICE level 3 model 
Equiv ileiil NAND gue level 3 model ind ihe Equivalent inveitei level 1 model loi 
diKetenl input ti insition times and Length = 0 5ii 


X 

PSPICE 

EQNAND 

% ERROR 

EQ INV 

% ERROR 

OS 

0 403 

0369 

8 47 

0 320 

20 59 

1 0 

0 St)4 

0 484 

3 97 

0 437 

noy 

2 0 

0713 

0 676 

5 19 

0 648 

9 12 

4 0 

1 0S8 

0 99S 

5 95 

1 020 

3 59 

80 

1 669 

1 564 

6 29 

1 680 

0 66 


4 ^ 





























RESULTS AND DISCUSSIONS 


4 2 CONCLUSIONS 

Simul Uion IS one of the key design validation tools that are required while 
designing complex intcgntcd circuits both for checking a circuit s function as well as 
pel foim nice 1 heic ue two methods of doing the tuning analysis for checking the delay 
pnimeteis ol my VI SI ciicuils One of the methods is to analyse the voltage and 
cun cuts tl etch and evciy node and do all the computation and get the result This 
method IS the one employed by simulators such as SPICE and is the most accurate one 
but the ptoblcm is that for VLSI circuits the time taken by such simulators is not 
affoidiblc paiticulaily for intermediate design checks The other method is to use less 
iccui itc models to gun moic efficiency in time but by compromising some of the 
icciir loy Much lescatch h is been focussed to develop new techniques that can enhance 
the tccui icy by using the simplci models itself there by improving both the accuracy as 
well as the efficiency One such technique has been presented in [1] But in the process of 
Imcaiistlion of the output wavefomis to simplify the computations the body effect had 
been ovci looked which has resulted in errors that could have been otherwise avoided 


In this work the complete modeling mentioned m [1] has been redone with the 
consider Uion of the body effect on the threshold voltage and this has resulted m much 
more accurate icsuUs with only little increase m the complexity All the results shown in 
this chapter, cleai ly show the improvement gained by incorporating the body effect m the 
cilculation of the starting point of conduction of a transistor chain while analysing 

complex CMOS gules 


4 3 FURTHER SCOPE 

It can be seen from the results that the output obtained by this new analysis is 
giving more accurate results for slower ramps than the faster ramps This is happening 
because m the PWL ipproximation that has been done to arrive at the simplified 
waveromi diagram of Figure 3 3 the waveform has been shown to be always falling from 
the time to t,.i with different rates of r ands But for faster input ramps the rate o 
charging of Ibe node parasitic capacitances will beeonae greater than the dram ciirren o 
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Ihc ti insisloi btlow it it the time the tnnsistor just enters into linear region from the 
satuntion icgion This will sometimes result in an increase of the node voltage instead of 
the dccicisc is has been shown So this will result in some error in the calculation of t| 
with fist input nmps This cm also be seen from the Figure 3 4 where the value of ti 
m itches nioie to tint obt lined by PSPICC simulations for slower ramps So if this region 
could be modeled moie 'icciiiately then the errors can be reduced further than obtained 
by the cm i cut correction However this will on the other hand increase the complexity of 
the algorithm and thus will reduce the efficiency 



APPENDIV I 


APPENDIX 1 

1 Moduhtion of threshold voltige due to the Body effect [7] 

ViN=Vio 

Whcic Vio IS the zero body bias threshold voltage and y is the body effect 
coefficient 


2 Intrinsic c ipncitances for a 4 terminal MOS transistor[7] 

Cgb 



“ Co 


fl(l + 2 t]) 
3(1 +l|)= 




C«d"C 






Cgid 


a,-l^ (l-n 




3a, 


•'OX 


1+71 


Where, vf 


I ^ Vds^Vdsat 

^ n 


DSAT 


Vds> Vdsat 


and ai= 1 + 




SB 


1 
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3 Coupling capacitance m Figuie 2 1 (C) W 

Whcic Cj^do me the gate source and the gate dram overlap capacitances 

4 Node c ipacit mce in Figure 2 1 (C) C, = 


n 
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appendix II 

1 Douvation foi Vin (equation 2 3) 

V , N = Vi o + Y ^ ^ ] 

L\p 'Hiding using Tayloi senes around Vsq = 0 2Vdd 
V iN= VinI.,, oa,,, + V'™l,^ o2.„AVsb-0 2Vdd) 


=. V™ = V,o+y(V02P^-^)+,^.i__(^ _ . 


“O + evsB 


where 


0 = V ro + 7 


^ + 2^0 -2^0 lV,^ri<Pn + (I>I 


uid 5 


y 


2V0 2I^»+A 


2 Deuvation foi plateau voltage Vi (equation 2 4) 

IMu ~ Ibtq 

==^K^K ~B-(S*S)V,'t=K^{Vno-r,of'\ 

Expanding L II S using Tayloi series upto Uie second order and rearranging tlie 

tCUllb, 




V: 


a 


(a-0 i 




+ 2rs„7,»-<?)“=0 


It IS of the form aVp^ P bVp Pc = 0 
~b±^b^ ~ 4ac 


So, V„ 


la 


in 
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3 Cqu Uion for cvaknting W^q (equation 2 5) 
Imn ~ Tmcc] 


> -fl-(i+ap.,T 


\ I'.-Vm J 


=■ W„,=W„| 

Where Vm Vm and Wcq are functions of time 


4 Solution for finding t 2 

dV, 


QZUL 
dt 


K, (f„-0-(1 + sK,)“ 


putting VM-Vi+m(t ti) & V,n=— r 

T 

=>c,^ = -^,„{-o-(i+sX^, -»",)+ 

Where «, = -0 -(I + SXf, - wr,) 


F, 


DD 


(1 + S) 




_ ' DD 


{'+«) 


l/?l 


Integrating fiom t = ti to t = ta and using the condition that at t = ti Vgs= Vj 

V„(lj) V„(l,)=-^fa4^2l2^ (1) 

C, aj.[a + l) 

Also, at this very instant Mn enters linear region from saturation region 
=> Vdsh = VnsAIn 

=> Vo(l2) - VM(t2) = V,„(t2) VM(t2) - [0 + 8VM(t2)] 

Vo(t 2 )= ■^’^^2-0-5(F, + wa(tj-r,)) 


IV 
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Putting in (n) nncl usm^ Ihe senes expansion it will become of the form of 
h bx^ ^ cx + d = 0 

where 

i==Ks ii^'*'’N.^a(a 1}/6 Cl 

b = Ks'l/“'^2a/2CL 

c = Ksai“/Ci + Vdd/t 5m 

d = Ksni^«"'V[n,C( (a+1)] Vnn 0 5(V, mt,) 

The solution will be 

b 2"'(3«c-l,>) 

3a 3a(p + qy' 2(2''^ a) 
whcie, 

p = ~2b^ + 9ahc - 21a^d 
mid q = ■\l4(3ac~I>^J + p^ 


5 Solution foi finding ts 2 (equations 2 17 & 2 18) 
At t = / M 2 enters conduction 


^ ?2 “ ^ 7^2 






/ — 0 
*52 ^0 

r 


(i+«,) 

from equation 2 17, 

=c 


— (a) 


Afl 


Uv>M) 


di 



dt 


(c,n = 


Assuming a - 1 and integrating from t = tsi to ts 2 
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Usint, ( 0 

^ r 9r ’’ 1 + S„ T T 




’7mj_L^1 - - /CiFrof,, + ^ + (Q/i + Q^il^si) 

l + (0 ^ ^ 


TO 


= 0 


Where ^)=77~'‘' 


DO 


And F,(ly,) = 


C 


ill 


TO 


11 IS 1 qn rdr^Uc cqintion of the form ax^ + bx +c -0 


So ( 1^2 


-h±\lb^ --4flc 


2a 


Putting m ( i) wc get 7 (^ 2 ) 
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1 Solution ol equation 3 1 




V, 


1)1) 

r 




U l = Is 


C 


Ml 


V 1 + C", J 


10 


Vosi = ViNl 




/)/) 


/S2=(l4 (1 I (5)\/, 


V.'. = 






(1 + 5) 

At t = ti Ml cnlus line u legion 
V|)s =Vgs - Vti 


H,= 


f y ^ 

Im-t -y 

'll *^70 
^ j 


( 1 ) 


2 1o lind 111 ( tqu lUon 3 3) 

Assuming a = i 


An 




clt cll 


' cll 


=> Cy-^=C,,^-Ks^t + K,V„ 

( ^ Vrin . Y7 V. . \ rf 


l)t) 


Ml * Q)(1^/I ^Ml y'^^S^TO p2 ^Sl) 

T I T 


fa 


Putting (1) in (2) 


(C^^l + -V;!, - Vs, j- Cwi ~^+ (/,2 /51) 


Vk 

T 


Re aiianging the teims, it is of the form of ax^+bx+c = 0 



A1 1 UNUIA 111 


Whcic 


a = 


Ks 


V 

^ nn 
2 t 


^ T 


TO 




2r 


til 


-b±^fh -4cic 


2a 


3 lo find Is-) (tqu Uion 3 5) 

lii<l<ls'> M| IS in liiieai legion 

>1=1 ml - I I 


=> A, 

( V 

h^LLi^y 

1 

rx„j 


1 ^ J 


T 


-(C.„+C,)^ (3) 
at 


Wt h ivt 




V ~V 
'si wi 


=:j> V,', =V1, + s(/,j-/„) 


(4) 


V, 






(1 + -5) 


fv ^ 

„ 's '^/O 

II 

V 1 

* UD 

1 T 

1 1 

X 


Now liom (3) K, 


PuUing Is-) =-;4i_.(i9 + (1 (jiom 5) 

''^iw 


(5) 

“ ' (t^An 4 C, ) (6J 


/f,(£)-V',„ + (l I AK,K,=-t^-)(C„,+C,) 


fiom (6) 


Ki 

fv ) 

v' =C 

\ 

Q 

' 

1 


[ ^ ) 


L ^ J 


' (^A( 1 + ) 


using (4) 


Vlll 



U LJ 


I 


K, 


V. 


Dl) 




^7 0 ihi 


Ml 




DD 


(< -'»)-(''s'.-'',o)(C.„+C,) 


usint (‘i) 


K, 


V. 


f' y t ^ 

''IW'SI Q ' 


niLi _ V 
's2 


(c„, + c,j 


(y t 

I nil's 2 0 


(1 + 5 ) 


WO 


(7) 

Solving (7) foi ts 

It IS ol tho loim ix"' +bx +CX +d = 0 

f^iVnn 

(H d)v 

K,y,„6 K,t,y^,, K,v„y,, 

(l I (5)r (l + 5)r^ (l + 5)r 

/ - ^A/i^wj , (^1 ''*~^mi)Kdd I ^i^hynn , , ^thy 1111 ^ 1(1 

T (l + 5)r (l + 5)r (1 + 5) (l + (5)r 

■/ - (Q'^^mi)^ I Qn^/i^w) (p I ^ W ^fihyi II 

(I I 5) r (II*) 


SO solution loi X is 

_ b 2''^('l«c-//) (/; + {/)'^^ 
3a 3a(/7 + 7/)''^’ 3(2''\{) 


p ~ -2b^ + 9ahc - 21a d 


q = ^4(3ac -//) t- /7^ 


4 To find ti, (tqu Uion 3 b) 

At t = tp M-) tiucis linen icgion fiom sU 

Vps'’ “ Vos'’ •“ VtN2 

V, =v,(/, 


IX 


AFPbNUlXlII 


But V, 


n 


C, 


M2 


Qo + c. 


DD 


+ ^(^/2 -^52) 


=> t 


12 


But 


0 + 0+5)!/,', 

C„2 V„„ ) 

'S2 

+ Q T 


(V ' \ 

1 ^ ' J 



0 + (l+SK = 


V, 


/)/) 


‘i2 


hi" hi 


I + 


( c ^ 

'-A/ 2 

C: 

1 

. ^M1 + ^2 , 

T 



\ 

/ 


h 


1 + 


c. 


c„ +c 


V, 


0!) 


fv, 


DO 


-r 


5 lo iincl Isi (equation 3 9) 


at t = ts 2 


VgS2 -VtN2 = 0 

^(„-0-(l + 5K'=O (8) 

T 

Now V(y| = V,| + ^(^^2 - ) 


But 
Putting in (8) 
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\h^Ml'^^l 
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DD 


h\ ' ih\ 
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d + {i+5]^^-^ 
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DD 
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